Hierarchical Clustering Given Confidence Intervals of Metric Distances

نویسندگان

  • Weiyu Huang
  • Alejandro Ribeiro
چکیده

This paper considers metric spaces where distances between a pair of nodes are represented by distance intervals. The goal is to study methods for the determination of hierarchical clusters, i.e., a family of nested partitions indexed by a resolution parameter, induced from the given distance intervals of the metric spaces. Our construction of hierarchical clustering methods is based on defining admissible methods to be those methods that abide to the axioms of value – nodes in a metric space with two nodes are clustered together at the convex combination of the distance bounds between them – and transformation – when both distance bounds are reduced, the output may become more clustered but not less. Two admissible methods are constructed and are shown to provide universal upper and lower bounds in the space of admissible methods. Practical implications are explored by clustering moving points via snapshots and by clustering networks representing brain structural connectivity using the lower and upper bounds of the network distance. The proposed clustering methods succeed in identifying underlying clustering structures via the maximum and minimum distances in all snapshots, as well as in differentiating brain connectivity networks of patients from those of healthy controls.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Confidence Sets

We propose a method for clustering a large set of observed objects with different noise levels based on their confidence set estimates rather than their point estimates. The minimal and maximal distances between confidence sets provide confidence intervals for the true distances between objects. The upper bounds of these confidence intervals are used to minimize the within clustering variabilit...

متن کامل

Unsupervised multidimensional hierarchical clustering

A method for multidimensional hierarchical clustering that is invariant to monotonic transformations of the distance metric is presented. The method derives a tree of clusters organized according to the homogeneity of intracluster and interpoint distances. Higher levels correspond to coarser clusters. At any level the method can detect clusters of different densities, shapes and sizes. The numb...

متن کامل

Clustering of Musical Sounds using

This paper describes a hierarchical clustering of musical signals based on information derived from spectral and bispectral acoustic distortion measures. This clustering reveals the ultra metric structure that exists in the set of sounds, with a clear interpretation of the distances between the sounds as the statistical divergence between the sound models. Spectral, bispectral and combined clus...

متن کامل

Generalising Ward’s Method for Use with Manhattan Distances

The claim that Ward's linkage algorithm in hierarchical clustering is limited to use with Euclidean distances is investigated. In this paper, Ward's clustering algorithm is generalised to use with l1 norm or Manhattan distances. We argue that the generalisation of Ward's linkage method to incorporate Manhattan distances is theoretically sound and provide an example of where this method outperfo...

متن کامل

Several remarks on the metric space of genetic codes

A genetic code, the mapping from trinucleotide codons to amino acids, can be viewed as a partition on the set of 64 codons. A small set of non-standard genetic codes is known, and these codes can be mathematically compared by their partitions of the codon set. To measure distances between set partitions, this study defines a parameterised family of metric functions that includes Shannon entropy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1610.04274  شماره 

صفحات  -

تاریخ انتشار 2016